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4 <110> APPLICANT: STEER, Brian 

5 CALLEN, Walter 

6 HEALEY, Shaun 

7 PULLI AM , Derrick 

9 <120> TITLE OF INVENTION: GLUCANASES, NUCLEIC ACIDS ENCODING THEM AND METHODS 

10 FOR MAKING AND USING THEM 

12 <130> FILE REFERENCE: 564462009500 
C--> 14 <140> CURRENT APPLICATION NUMBER: US/10/560,957 
C--> 15 <141> CURRENT FILING DATE: 2005-12-14 

17 <150> PRIOR APPLICATION NUMBER: PCT/US2004/021492 

18 <151> PRIOR FILING DATE: 2004-07-02 

20 <150> PRIOR APPLICATION NUMBER: 60/484,725 

21 <151> PRIOR FILING DATE: 2003-07-02 
23 <160> NUMBER OF SEQ ID NOS : 518 
25 <170> SOFTWARE: Patent In version 3.1 

27 <210> SEQ ID NO: 1 

28 <211> LENGTH: 1035 
2 9 <212> TYPE: DNA 
30 <213> ORGANISM: Unknown 

32 <220> FEATURE: 

33 <223> OTHER INFORMATION: Obtained from an environmental sample. 
35 <400> SEQUENCE: 1 



36 atgagagata taagtcctgc agagctggtt gccgagatga caaccggatg gaatcttgga 60 

37 aatacctttg atgcatatgg aaaaggcggt cttgatgatg agacaggctg gggaaatccc 120 

38 tatactacta aggaaatgat tgatgtagtc tgtgaaaagg ggtttaattc tatcagaatc 180 

39 ccaataacct gggctgatca tatgggtgct gctcctgact atacagtaga tgaggactgg 240 

40 atgaaccgtg tagaagaggt tgtaaattat gctcttgatg acgggatgta tgtcattatc 300 

41 aattcccacc acgaagagtc ctggagaatc cctgatgatg cacacattga tgcagtagat 360 

42 gaacaggttg gaaagctctg ggtccagata gctgagaggt tcagggatta tggcgaccat 42 0 

43 cttatttttg aggggctaaa tgagccacgt gttaagggcg gtgaaaatga gtggaatggc 480 

44 ggaacgaccg aaggacgtaa atgcctggac agacttaatc agacttttgt agattcagta 540 

45 agatcaacag gtggaaataa tgaaaaaaga cttgtactta taacaagctt tgcatcctca 600 

46 cacgtaatac agacaatagg aagccttaaa attccaagcg acgatcacct tgctgtttca 660 

47 atccatgcct atacgcctta tgattttaca tatgcctccg gcacctcctc tgagctttta 720 

48 acctgggatg gttccagaaa aagtgatatt gcttctgtta ttggtgatgt aaaaagaatc 780 

49 tttatagaca agggtattcc tgtccttatg acagaatatg gtgcagttga taaagatggc 840 

50 aactccggtg atgtaagcgc ctgggtaact gagtatttaa cacgcgcaaa aaaagccggt 900 

51 atcccatgct tttggtggga caatggcctg tatgaatcag gtgatgaaca ttttgctata 960 

52 ttcaaccgca atgacctgac ctggtacaga gaagacgtcg ttgatgccat tatggctgtc 1020 

53 tactatgccc aataa 1035 

55 <210> SEQ ID NO: 2 

56 <211> LENGTH: 344 

57 <212> TYPE: PRT 
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58 <213> ORGANISM: Unknown 

60 <220> FEATURE: 

61 <223> OTHER INFORMATION: Obtained from an environmental sample. 
63 <400> SEQUENCE: 2 



64 


Met 


Arg 


Asp 


He 


Ser 


Pro 


Ala 


Glu 


Leu 


Val 


Ala 


Glu 


Met 


Thr 


Thr 


Gly 


65 


1 
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10 










15 




66 


Trp 


Asn 


Leu 


Gly 


Asn 


Thr 


Phe 


Asp 


Ala 


Tyr 


Gly 


Lys 


Gly 


Gly 


Leu 


Asp 


b / 








2 0 










25 










30 






68 


Asp 


Glu 


Thr 


Gly 


Trp 


Gly 


Asn 


Pro 


Tyr 


Thr 


Thr 


Lys 


Glu 


Met 


He 


Asp 


69 






35 










40 










45 








70 


Val 


Val 


Cys 


Glu 


Lys 


Gly 


Phe 


Asn 


Ser 


He 


Arg 


He 


Pro 


He 


Thr 


Trp 


71 




50 










55 










60 










72 


Ala 


Asp 


His 


Met 


Gly 


Ala 


Ala 


Pro 


Asp 


Tyr 


Thr 


Val 


Asp 


Glu 


Asp 


Trp 


73 


65 










70 










75 










80 


74 


Met 


Asn 


Arg 


Val 


Glu 


Glu 


Val 


Val 


Asn 


Tyr 


Ala 


Leu 


Asp 


Asp 


Gly 


Met 


75 










85 










90 










95 




76 


Tyr 


Val 


He 


He 


Asn 


Ser 


His 


His 


Glu 


Glu 


Ser 


Trp 


Arg 


He 


Pro 


Asp 


77 








100 










105 










110 






78 


Asp 


Ala 


His 


He 


Asp 


Ala 


Val 


Asp 


Glu 


Gin 


Val 


Gly 


Lys 


Leu 


Trp 


Val 


79 






115 










120 










125 








80 


Gin 


He 


Ala 


Glu 


Arg 


Phe 


Arg 


Asp 


Tyr 


Gly 


Asp 


His 


Leu 


He 


Phe 


Glu 


81 




130 










135 










140 










82 


Gly Leu 


Asn 


Glu 


Pro 


Arg 


Val 


Lys 


Gly 


Gly 


Glu 


Asn 


Glu 


Trp 


Asn 


Gly 


83 


145 










150 










155 










160 


84 


Gly 


Thr 


Thr 


Glu 


Gly 


Arg 


Lys 


Cys 


Leu 


Asp 


Arg 


Leu 


Asn 


Gin 


Thr 


Phe 


85 










165 










170 










175 




86 


Val 


Asp 


Ser 


Val 


Arg 


Ser 


Thr 


Gly 


Gly 


Asn 


Asn 


Glu 


Lys 


Arg 


Leu 


Val 


87 








180 










185 










190 






88 


Leu 


He 


Thr 


Ser 


Phe 


Ala 


Ser 


Ser 


His 


Val 


He 


Gin 


Thr 


He 


Gly 


Ser 


89 






195 










200 










205 








90 


Leu 


Lys 


He 


Pro 


Ser 


Asp 


Asp 


His 


Leu 


Ala 


Val 


Ser 


He 


His 


Ala 


Tyr 


91 




210 










215 










220 










92 


Thr 


Pro 


Tyr 


Asp 


Phe 


Thr 


Tyr 


Ala 


Ser 


Gly 


Thr 


Ser 


Ser 


Glu 


Leu 


Leu 


93 


225 










230 










235 










240 


94 


Thr 


Trp 


Asp 


Gly 


Ser 


Arg 


Lys 


Ser 


Asp 


He 


Ala 


Ser 


Val 


He 


Gly 


Asp 


95 










245 










250 










255 




96 


Val 


Lys 


Arg 


He 


Phe 


He 


Asp 


Lys 


Gly 


He 


Pro 


Val 


Leu 


Met 


Thr 


Glu 


97 








260 










265 










270 






98 


Tyr 


Gly 


Ala 


Val 


Asp 


Lys 


Asp 


Gly 


Asn 


Ser 


Gly 


Asp 


Val 


Ser 


Ala 


Trp 


99 






275 










280 










285 









100 Val Thr Glu Tyr Leu Thr Arg Ala Lys Lys Ala Gly He Pro Cys Phe 

101 290 295 300 

102 Trp Trp Asp Asn Gly Leu Tyr Glu Ser Gly Asp Glu His Phe Ala He 

103 305 310 315 320 

104 Phe Asn Arg Asn Asp Leu Thr Trp Tyr Arg Glu Asp Val Val Asp Ala 

105 325 330 335 

106 He Met Ala Val Tyr Tyr Ala Gin 

107 340 

109 <210> SEQ ID NO: 3 o 
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110 <211> LENGTH: 1122 

111 <212> TYPE: DNA 

112 <213> ORGANISM: Unknown 

114 <220> FEATURE: 

115 <223> OTHER INFORMATION: Obtained from an environmental sample. 

117 <400> SEQUENCE: 3 

118 gctgacctgc ggcgacgtcg 

119 tgcagtgcgc aggccgcgcc 

12 0 agcctgagcc gcgacggacg 

121 gaaggccagt cctatgcgtt 

122 cgcatcctgg gctggaccca 

123 gcctggttgt gggggcggga 

124 tcggactccg atctctggct 

125 cccgcgttga aggccatcgc 

126 gacctgcccg ggctcggccc 

127 gcgacgcggg tcaaccccag 

128 gatcgcagcg ggccgtggca 

129 agcccaaagg ggttcgcacc 

13 0 .ccggtgcgcg gcgcggtcgg 

131 accgcgccgc gcgatgcgtt 

132 cgcctgcgca gtggtgcgcc 

133 ggcgaaggca actacggatt 

134 gcggagcgcg cgcagtcgct 

135 gcaccggcct actactcgca 

136 tggcgtttcg ccgccgacgg 

138 <210> SEQ ID NO: 4 

139 <211> LENGTH: 373 

140 <212> TYPE: PRT 

141 <213> ORGANISM: Unknown 

143 <220> FEATURE: 

144 <223> OTHER INFORMATION: Obtained from an environmental sample. 

146 <220> FEATURE: 

147 <221> NAME/KEY: SIGNAL 

148 <222> LOCATION: (1) . . . (25) 
150 <400> SEQUENCE: 4 



151 


Ala 


Asp 


Leu 


Arg 


Arg 


Arg 


Arg 


"'Leu 


Leu 


Gin Ala Ala 


Ala 


Thr Leu 


Pro 


152 


1 








5 










10 




15 




153 


Leu 


Leu 


Gly 


Trp 


Cys 


Ser 


Ala 


Gin 


Ala 


Ala Pro Ala 


Pro 


Arg Trp 


Pro 


154 








20 










25 






30 




155 


Ala 


Trp 


Gin 


Val 


Leu 


Leu 


Asp 


Ser 


Ser 


Leu Ser Arg 


Asp 


Gly Arg 


Met 


156 






35 










40 






45 






157 


He 


Asp 


Arg 


Ser 


Gin 


Asp 


Asp 


Gin 


Arg 


Ser Thr Ser 


Glu 


Gly Gin 


Ser 


158 




50 










55 






60 








159 


Tyr 


Ala 


Leu 


Phe 


Phe 


Ala 


Leu 


Val 


Asp 


Asn Asp Gin 


Ala 


Leu Phe 


Asp 


160 


65 










70 








75 






80 


161 


Arg 


He 


Leu 


Gly 


Trp 


Thr 


Gin 


Asp 


Asn 


Leu Ala Gly Gly 


Asp Met 


Arg 


162 










85 










90 




95 




163 


Gin 


His 


Leu 


Pro 


Ala 


Trp 


Leu 


Trp 


Gly Arg Asp Asp 


Lys 


Gly Ser 


Trp 


164 








100 










105 






110 





cctgctgcag gcggccgcca cgctgccgct gcttggctgg 60 

cgcaccgcgc tggccggcgt ggcaggtact gctcgacagc 12 0 

catgatcgat cgcagccagg acgatcagcg cagcacctcc 180 

gttcttcgcc ctggtcgaca acgaccaggc gctgttcgac 240 

ggacaacctc gccggtggcg acatgcgcca gcacctgccc 300 

tgacaaggga agctggcggg tactggacga caacccggcc 360 

ggcctatgcg ctgctggaag gcgcgaggct gtggcgccgt 420 

cgagggcctg ctcgcgcagg tgcgggcacg cgagatcgtg 480 

gatgctgttg ccgggcccac agggattcac cgaaggggac 540 

ctatctgcca ctgccgctgc tgcgccgctt tgcggtggag 600 

ggcgctggcg cgtaacagcg tgcagctgtt gcagcagacc 660 

ggactgggcg gcgtggaaag gcgatcgttt cgtcgtcgat 72 0 

cagctacgat gcgatccgct gctacacctg ggcaggcatg 780 

gttccgcacg cagctggcgg ccctgtccgg gccgctgcag 840 

gatgtgggaa aaaatcgaca cccgcagtgg ccaggggcag 900 

ccgcgcggca ctgctgccgt atctgatcgc gcagggcgac 960 

gcgggccagc ctgcccagcg ccgaacagca gcgtgctgat 102 0 

gatgctggcc ctgttcggcc tgggctgggc cgaagggcgc ' 1080 

ccgcctgcag ccgcgctggt ag 1122 
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165 Arg Val Leu Asp Asp Asn Pro Ala Ser Asp Ser Asp Leu Trp Leu Ala 

166 115 120 125 

167 Tyr Ala Leu Leu Glu Gly Ala Arg Leu Trp Arg Arg Pro Ala Leu Lys 

168 130 135 140 

169 Ala He Ala Glu Gly Leu Leu Ala Gin Val Arg Ala Arg Glu He Val 

170 145 150 155 160 

171 Asp Leu Pro Gly Leu Gly Pro Met Leu Leu Pro Gly Pro Gin. Gly Phe 

172 165 170 175 

173 Thr Glu Gly Asp Ala Thr Arg Val Asn Pro Ser Tyr Leu Pro Leu Pro 

174 180 185 190 

175 Leu Leu Arg Arg Phe Ala Val Glu Asp Arg Ser Gly Pro Trp Gin Ala 

176 195 200 205 

177 Leu Ala Arg Asn Ser Val Gin Leu Leu Gin Gin Thr Ser Pro Lys Gly 

178 210 215 220 

179 Phe Ala Pro Asp Trp Ala Ala Trp Lys Gly Asp Arg Phe Val Val Asp 

180 225 230 235 240 

181 Pro Val Arg Gly Ala Val Gly Ser Tyr Asp Ala He Arg Cys Tyr Thr 

182 245 250 255 

183 Trp Ala Gly Met Thr Ala -Pro Arg Asp Ala Leu Phe Arg Thr Gin Leu 

184 '260 265 270 

185 Ala Ala Leu Ser Gly Pro Leu Gin Arg Leu Arg Ser Gly Ala Pro Met 

186 275 280 285 

187 Trp Glu Lys He Asp Thr Arg Ser Gly Gin Gly Gin Gly Glu Gly Asn 

188 290 295 300 

189 Tyr Gly Phe Arg Ala Ala Leu Leu Pro Tyr Leu He Ala Gin Gly Asp 

190 305 310 315 320 

191 Ala Glu Arg Ala Gin Ser Leu Arg Ala Ser Leu Pro Ser Ala Glu Gin 

192 325 330 335 

193 Gin Arg Ala Asp Ala Pro Ala Tyr Tyr Ser Gin Met Leu Ala Leu Phe 

194 340 345 350 

195 Gly Leu Gly Trp Ala Glu Gly Arg Trp Arg Phe Ala Ala Asp Gly Arg 

196 355 360 365 

197 Leu Gin Pro Arg Trp 

198 370 

201 <210> SEQ ID NO: 5 

202 <211> LENGTH: 954 
2 03 <2 12 > TYPE : DNA 

204 <213> ORGANISM: Bacteria 

206 <400> SEQUENCE: 5' 

207 atgggtgttg atccttttga aaggaacaaa atattgggaa gaggcattaa tataggaaat 60 
209 gcgcttgaag caccaaatga gggagactgg ggagtggtga taaaagatga gttcttcgac 120 
211 attataaaag aagccggttt ctctcatgtt cgaattccaa taagatggag tacgcacgct 180 
213 tacgcgtttc ctccttataa aatcatggat cgcttcttca aaagagtgga tgaagtgata 240 
215 aacggagccc tgaaaagagg actggctgtt gttataaata ttcatcacta cgaggagtta 3 00 
217 atgaatgatc cagaagaaca caaggaaaga tttcttgctc tttggaaaca aattgctgat 360 
219 cgttataaag actatcccga aactctattt tttgaaattc tgaatgaacc tcacggaaat 420 
221 cttactccgg aaaaatggaa tgaactgctt gaggaagctc taaaagttat aagatcaatt 480 
223 gacaaaaagc acactataat tataggcaca gctgaatggg ggggtatatc tgcccttgaa 540 
225 aaactgtctg tcccaaaatg ggaaaaaaat tctatagtta caattcacta ctacaatcct 600 
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660 
720 
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840 
900 
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O 1 *7 

22 / 


ttcgaattta 


cccatcaagg agctgagtgg gtggaaggat 


ctgagaaatg 1 


gttgggaaga 


2.2.3 


aagtggggat 


ctccagatga tcagaaacat ttgatagaag 


aattcaattt 


tatagaagaa 




tggtcaaaaa 


agaacaaaag accaatttac ataggtgagt 


ttggtgccta 


cagaaaagct 


O 1 ~\ 
2.3 3 


gaccttgaat 


caagaataaa atggacctcc tttgtcgttc 


gcgaaatgga 


gaaaaggaga 


ZOO 


tggagctggg 


catactggga attttgttcc ggttttggtg 


tttatgatac 


tctgagaaaa 


") -3 —t 

2 6 1 


acctggaata 


aagatctttt agaagcttta ataggaggag 


atagcattga 


ataa 




0 A f\ 


<210> SEQ ID NO 


: 6 
























241 


<211> LENGTH: 317 
























OA O 


<212> TYPE: 


PRT 


























z4J 


<213> ORGANISM: 


Bacteria 




















z4b 


<220> FEATURE: 


























246 


<221> NAME /KEY: 


DOMAIN 






















247 


<222> LOCATION: 


(19) 




(296) 




















248 


<223> OTHER 


INFORMATION 


: Cellulase 


(glycosyl hydrolase 


family 5) 


250 


<4 00> SEQUENCE: 


6 
























251 


Met 


Gly 


Val 


Asp 


Pro 


Phe 


Glu 


Arg 


Asn 


Lys 


He 


Leu Gly Arg 


Gly 


He 


252 


1 








5 










10 










15 




255 


Asn 


lie 


Gly Asn 


Ala 


Leu 


Glu 


Ala 


Pro 


Asn 


Glu 


Gly Asp 


Trp 


Gly 


Val 


T C C 

^ bo 








2 0 










25 










30 






259 


Val 


lie 


Lys 


Asp 


Glu 


Phe 


Phe 


Asp 


He 


He 


Lys 


Glu Ala Gly 


Phe 


Ser 


260 






35 










40 










45 








263 


His 


Val 


Arg 


He 


Pro 


He 


Arg 


Trp 


Ser 


Thr 


His 


Ala 


Tyr 


Ala 


Phe 


Pro 


264 




50 










55 










60 










267 


Pro 


Tyr 


Lys 


He 


Met 


Asp 


Arg 


Phe 


Phe 


Lys 


Arg 


Val 


Asp 


Glu 


Val 


He 


zoo 


65 










70 










75 










80 


271 


Asn 


Gly 


Ala 


Leu 


Lys 


Arg 


Gly 


Leu 


Ala 


Val 


Val 


He 


Asn 


He 


His 


His 


272 










85 










90 










95 




275 


Tyr 


Glu 


Glu 


Leu 


Met 


Asn 


Asp 


Pro 


Glu 


Glu 


His 


Lys 


Glu 


Arg 


Phe 


Leu 


276 








100 










105 










110 






279 


Ala 


Leu 


Trp 


Lys 


Gin 


He 


Ala 


Asp 


Arg 


Tyr 


Lys 


Asp 


Tyr 


Pro 


Glu 


Thr 


Z O U 






115 










120 










125 








283 


Leu 


Phe 


Phe 


Glu 


He 


Leu 


Asn 


Glu 


Pro 


His 


Gly Asn 


Leu 


Thr 


Pro 


Glu 


284 




130 










135 










140 










287 


Lys 


Trp 


Asn 


Glu 


Leu 


Leu 


Glu 


Glu 


Ala 


Leu 


Lys 


Val 


He 


Arg 


Ser 


He 


288 


145 










150 










155 










160 


291 


Asp 


Lys 


Lys 


His 


Thr 


He 


He 


lie 


Gly 


Thr 


Ala 


Glu 


Trp 


Gly 


Gly 


He 


0 0 0 
z y z 










165 










170 










175 




295 


Ser 


Ala 


Leu 


Glu 


Lys 


Leu 


Ser 


Val 


Pro 


Lys 


Trp 


Glu 


Lys 


Asn 


Ser 


He 


296 








180 










185 










190 






299 


Val 


Thr 


lie 


His 


Tyr 


Tyr 


Asn 


Pro 


Phe 


Glu 


Phe 


Thr 


His 


Gin 


Gly 


Ala 


300 






195 










200 










205 








303 


Glu 


Trp 


Val 


Glu 


Gly 


Ser 


Glu' 


Lys 


Trp 


Leu 


Gly Arg 


Lys 


Trp 


Gly 


Ser 


304 




210 










215 










220 










307 


Pro 


Asp 


Asp 


Gin 


Lys 


His 


Leu 


He 


Glu 


Glu 


Phe 


Asn 


Phe 


He 


Glu 


Glu 


308 


225 










230 










235 










240 


311 


Trp 


Ser 


Lys 


Lys 


Asn 


Lys 


Arg 


Pro 


He 


Tyr 


He 


Gly Glu 


Phe 


Gly 


Ala 


312 










245 










250 










255 




315 


Tyr 


Arg 


Lys 


Ala 


Asp 


Leu 


Glu 


Ser 


Arg 


He 


Lys 


Trp 


Thr 


Ser 


Phe 


Val 


316 








260 










265 










270 
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Please Note; 

Use of n and/or Xaa have been detected In the Sequence Listing. Please review the 
Sequence Listing to ensure that a corresponding explanation is presented in the <22 0> 
to <223> fields of each sequence which presents at least one n or Xaa. 

Seq#:223; N Pos . 1759,1760,1761 
Seq#:224; Xaa Pos. 587 
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